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Abstract 

The purpose of this paper is to highlight the central role that the time asymmetry of stability 
plays in feedback control. We show that this provides a new perspective on the use of doubly-infinite 
or semi-infinite time axes for signal spaces in control theory. We then focus on the implication 
of this time asymmetry in modeling uncertainty, regulation and robust control. We point out that 
modeling uncertainty and the ease of control depend critically on the direction of time. We also 
discuss the relationship of this control-based time-arrow with the well known arrows of time in 
physics. 



I. Introduction 

The origin and implications of the "arrow of time" is one of the deepest and least 
understood subjects of physics. The "arrow" is an intrinsic part of the world as we know 
it. Yet its emergence in thermodynamics and cosmology, from physical laws which are 
apparently impervious to it, remains a controversial subject [29]. At first sight, this subject 
may seem unconnected with the theory of feedback control. However, starting from the 
very basic fact that our notion of stability in the sense of Lyapunov is time-asymmetric, we 
argue that the "arrow of time" does have important implications on modeling and uncertainty, 
robustness of stability, as well as on the topology for the study of the dynamics of feedback 
interconnections . 

The circle of ideas that gave rise to this paper began in a short note published by the 
authors thirteen years ago [8]. There, it was pointed out that the doubly-infinite time axis 
presents some "intrinsic difficulties" for developing a suitable input-output systems theory — 
difficulties that are not present in the semi-infinite time axis setting. These difficulties are 
not mere mathematical technicalities. Rather, they relate fundamentally to the consistency of 
the theory of stabilizability across different frameworks. Subsequently, a number of papers 
were written which shed light on the problem [22], [23], [24], [15], [16], [17]. The present 
paper takes a fresh look and traces the origin of the "puzzle" to the arrow of Lyapunov 
stability, and then, explores the relevance of this arrow to the topology of dynamical systems 
and feedback theory. 

The relationship of the modern theory of dynamical systems with classical physics and 
thermodynamics is a developing one. A classical contribution by Nyquist and Johnson [28], 
[18] is a derivation of the electromotive force due to thermal agitation in conductors. In [4] 
the issue of irreversibility is treated from the point of view of stochastic control theory. More 
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recently [13] has sought to formalize classical thermodynamics in the mathematical language 
of modern dynamical systems (see also [5]). In [27] information flow and entropy have been 
studied in the context of the Kalman filter. In [31] it is shown that a linear macroscopic 
dissipative system can be approximated by a linear lossless microscopic system over arbitrary 
long time intervals. Our point of view here is influenced by [29] and is somewhat different 
to the above references in that our main goal is to highlight a time-asymmetry, point out its 
implications, and discuss its relationship to other well-known asymmetries. 

The present paper begins by providing a new explanation of the issues raised in [8] 
with regard to an input-output theory for the doubly-infinite time axis. In Section [III] we 
introduce the time-conjugation operator and discuss the implications of the time-arrow in 
optimal control problems. In Section|IV]we analyse the effect of the time-arrow on modelling 
uncertainty; we show that dynamical systems which are close in the usual sense, that a 
common controller can stabilise and give similar closed-loop responses for either, may not 
be close when the time-arrow is reversed. Then, in Section |V] we further illuminate the 
inherent time-asymmetry in our ability to control a dynamical system with two specific 
examples. These can be thought of as examples of time irreversible feedback phenomena 
(see Section IV-BD . In Section [VI] we briefly discuss the arrow of time in physics and its 
relation to the time-arrow of feedback stability. Finally, in Section IVIII we consider feedback 
loops with small time delays and discuss the contrasting effects of delays and predictors 
and the connection with the arrow of time. 



A. Input-output and Lyapunov stability 

We focus on finite-dimensional linear dynamical systems which, for the most part, are 
assumed to be time-invariant. The dimensions of input, state and output (column) vectors, as 
well as the consistent sizes of transformation matrices in state-space models, are suppressed 
for notational simplicity. The following result is basic and well-known, cf. [38, p. 52-53], 



Proposition 1: Let P be a linear time-invariant finite-dimensional system which is 
controllable and observable and is specified by 



with an initial condition x(0) = 0. Then y e £ 2 [0, oo) for all u e £ 2 [0, oo) if and only if 
the matrix A is Hurwitz. Moreover, if this condition holds, y is determined uniquely by 

y(s) = (C(sl - Ay x B + D)u(s), where * denotes the Laplace transform. 

Many variants and extensions of the result are familiar: signal spaces with different 
norms can also be used; there is a finite-gain property relating the £ 2 -norms of y and u; 
even with x(0) ^ the main equivalence in the proposition still holds. Here we would 
like to highlight the fact that the result establishes an equivalence between stability defined 
in terms of the forced response and stability defined in terms of the free response, i.e. an 



II. Time-asymmetry and stability 



[12, p. 82]. 



x = Ax + Bu, 
y = Cx + Du, 



(1) 

(2) 



equivalence between bounded-input/bounded-output (BIBO) stability and Lyapunov stability 
for a system operating on the positive time-axis. Asymptotic stability in the sense of 
Lyapunov is obviously a time- asymmetric concept since convergence of the state vector 
is required as t tends to PLUS infinity, starting from an arbitrary initial condition at t = 0. 
In itself, BIBO stability does not appear to have this asymmetry, yet it is implicit in the 
formulation of Proposition [Q 

To further illustrate the point we can write down the following obvious corollary of 
Proposition [H obtained by running time backwards from to — oo. By changing the support 
of the signal spaces from the positive half-line to the negative half-line stability defined 
through the forced response (BIBO stability) becomes equivalent to asymptotic stability in 
the sense of Lyapunov for the reversed time-direction as t tends to MINUS infinity. 

Proposition 2: Let P be a linear system as in Proposition Q] with x(0) = 0. Then 
y e C 2 (-oo, 0] for all u e C 2 (-oo, 0] if and only if the matrix -A is Hurwitz. 

We now turn to the situation where inputs and outputs may have support on the doubly- 
infinite time-axis. In this case the following holds, e.g. see [44, p. 101]. 

Proposition 3: Let P be a linear system as in Proposition QJ Then there exists y e 
C 2 (-oo, oo) for all u e £ 2 (-oo, oo) if and only if A has no imaginary-axis eigenvalues. 
Moreover, if this condition holds, y is determined uniquely by y(s) = (C(sl - Ay 1 B + 
D)u(s). 

We remark that Proposition [3] is the natural generalisation of Proposition \T\ when systems 
are viewed as operators. A linear system in Proposition \T\ becomes a multiplication operator 
on the Fourier transformed spaces. The operator is bounded if and only if the "symbol" 
(the transfer-function) belongs to H^, which under the controllability and observability 
assumption is equivalent to A being Hurwitz. On the double-axis a multiplication operator 
on the Fourier transformed spaces is bounded if and only if the symbol belongs to — 
which for rational symbols excludes only poles on the imaginary axis. 

In Proposition [3] there is no longer any relationship between a notion of BIBO stability 
and Lyapunov stability (in either time-direction). Clearly, both A and —A may fail to 
be Hurwitz. Since only the existence of some y E C 2 { — oo, oo) is required for a given 
u E C 2 (— oo, oo), and the free motion solutions of ([]]) are ignored, this is not surprising. 
Propositions Q] and [2] by contrast, establish a connection between BIBO stability and Lya- 
punov stability as t — > +oo (respectively, t — > — oo) without putting in explicit requirements 
on the free motion solutions. 

We now consider the feedback interconnection in the form of Fig. \T\ where P and C are 
linear systems. The existence of signals u h yj E {1,2}) in C 2 [0, oo) which satisfy the 
feedback equations for a given pair of external inputs u , y in C 2 [0, oo), for a given set of 
initial conditions, is a well-known and natural definition of stability in terms of the forced 
response. From Proposition [T] stability in this sense is equivalent to asymptotic stability in 
the sense of Lyapunov of the combined state- space (assuming minimal realizations for P 
and C and well-posedness). Again, BIBO stability inherits the required time- asymmetry 
from the asymmetry of the support interval [0, oo). 
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Fig. 1. Standard feedback configuration. 



It is apparent that the corresponding definition of BIBO stability for this feedback 
interconnection with £ 2 (— oo, oo) signals, generalising Proposition [3l will not correspond to 
a sensible notion of closed-loop stability. Indeed, we can easily check that a system P with 
transfer funtion P(s) = l/(s — 1) is "stabilised" by any of the controllers with C(s) = 2, 
C(s) = 0, or C(s) = — 0.5/(s + 1). (In conventional terms the controllers give closed-loop 
poles which are in the open left-half plane (LHP), the open right-half plane (RHP), and in 
both half planes, respectively.) 

We can summarize the points so far as follows. Stability is a time-asymmetric concept — 
the requirement of an asymptotic property as t tends to PLUS infinity defines a time 
arrow. If stability is defined by requiring bounded outputs in response to bounded inputs 
then a time arrow is not obviously implied. However, for signal spaces with support on a 
positive (resp. negative) half-line, the definition turns out to imply a positive (resp. negative) 
time arrow. On the other hand, a bounded-input bounded-output definition of stability for 
signals with support on the doubly-infinite time-axis does not define a preferred time arrow. 
Stable systems defined by bounded "multiplication operators" may be stable in the sense of 
Lyapunov in the positive time-direction, in the negative time-direction or in neither direction. 



B. The two-sided time axis and causality 

The fact that the doubly-infinite time axis causes problems for the analysis of stability 
and of stabilisation was pointed out in [8]. The explanation given there is consistent with 
that of Section HH but the overall argument was somewhat different. We now summarize the 
reasoning of [8]. 

Two systems P; (i = 1,2) defined by convolution operators were considered: 

/oo 
hi{t — r)u(r)dT = hi* u 
-oo 

where hi(t) = e t for t > and zero otherwise, and h 2 (t) = — e* for t < and zero 
otherwise, respectively. Each system has (double-sided Laplace) transfer function equal to 
l/(s — 1), but with differing regions of convergence. The first system is unstable and causal 
and the second is stable and non-causal (in fact anticausal) according to the usual definitions. 

When viewed on C 2 (—oo, oo), P 2 is a bounded operator and hence is a stable system in 
an input-output sense. On the other hand, it was shown in [8] that P x fails to be stabilisable 



on £ 2 (— oo, oo). This is a counterintuitive result since Pi is stabilisable in the ordinary way 
on any positive half-line. The proof that Pi fails to be stabilisable on the doubly-infinite 
time-axis reduces to the observation that the graph of Pi fails to be closed. 

It was also pointed out in [8] that the closure of the graph of P 1 coincides with the 
graph of P 2 . Once the graph is closed there appears to be no problem with stabilisation. 
But in closing the graph "anti-causal" trajectories are brought in which are inconsistent with 
the convolution representation of the system, so this was considered inadmissible. 

Another possible remedy discussed in [8] was to consider the underlying differential 
equation representations rather than the convolution representations. In fact both systems 
are defined by the same differential equation 

V = y + u. (3) 

More precisely, the trajectories of both Pi and P 2 satisfy this equation. In terms of "flow 
of time" thinking, P 1 appears to arise by solving this equation forwards in time while P 2 
is obtained by solving it backwards. This suggestion seems to make stronger the argument 
to consider Pi and P 2 to be the same system. But this was considered unnatural in [8] on 
the grounds that it appears to abandon any notion of causality, or that it leaves the direction 
of time undefined. 

The discussion of Section [TT] allows the difficulties pointed out in [8] to be explained 
in a new way. Let us suppose we are willing to accept the closure of the graph of Pi 
which makes it "stabilizable" on the double-axis in a bounded-input/bounded-output sense. 
As explained, Pi and P 2 can now be thought of as one and the same system defined by 
© — a state-space description as in CE© solved forwards or backwards as desired. Does the 
closure of the graph resolve the difficulty pointed out in [8]? The answer is no, since the 
notion of stability does not correspond to the usual notions. As is made clear by Proposition 
|3l the feedback system may turn out to be stable in a conventional sense in the forward, 
backward or neither time-directions. 

C. The work of Makila, Partington and Jacob 

A number of interesting observations and contributions have followed from [8] which 
we would like to comment on here. 

The fact that a causal system on the double-axis can have a non-causal closure has led 
to a study of "closability" and "causal closability" as questions in their own right. Makila 
[25] has shown that the lack of causal closability for the example of [8] extends to general 
L v spaces on the double-axis. Jacob and Partington [17] give general characterisations of 
the graphs of time-invariant systems and derive necessary and sufficient conditions for the 
closure of a closable system to be causal. In [22] Makila and Partington consider weighted 
£ 2 -spaces on the double-axis and show that, when signals have very rapid decrease to zero 
towards — oo, causal convolution operators may be closed operators. (So there is no issue 
of causality being lost due to the operation of closure.) 

On the question of stabilization on the double time-axis, Jacob [15] has made an 
interesting suggestion. We have seen already that closing the graph and applying the BIBO 



stability definition fails to recover the usual concept of stability. Jacob proposed that causality 
of the closed-loop operators of the feedback system be added as an extra requirement. Jacob 
showed that the resulting characterisation of stability agrees with the usual definitions for 
linear time-invariant systems. In the context of the present paper we can re-interpret this 
result by saying that the causality condition forces the positive time-arrow into feedback 
system stability. We can understand this as follows. In [17] it is shown that a closed linear 
time-invariant system is causal on £ 2 (— 00,00) if and only if the corresponding transfer 
function belongs to a certain Smirnov class. For finite dimensional systems this is equivalent 
to the transfer function having no right half-plane poles. Thus, in Proposition [31 if P is 
required to be causal, BIBO stability agrees with Lyapunov stability with the positive time- 
arrow. 

Makila and Partington in [22] make an interesting observation on the possible extension 
of Jacob's idea to the time-varying situation. They consider a causal, convolution operator 
derived from the underlying differential equation 

y{t) + a(t)y(t) = u(t) (4) 

where a(t) = — 1 for t < and a(t) = +1 for t > and point out that the closure of the 
£2 (—00, 00) graph of the convolution system is not the graph of an operator. Essentially this 
boils down to the fact that there are free motion solutions y(t) = ce~'*', u(t) = 0, where c 
is a constant, which can be approximated arbitrarily closely by elements of the graph. This 
raises the question of whether the approach of Jacob can recover a theory of stabilization 
which is consistent with the single-axis case. At the same time it is pointed out that the 
system is stabilizable in a Lyapunov sense by the feedback 

u(t) = -2y(t). (5) 

In the present context this example highlights the care that is needed in defining stability for 
time-varying systems, even in the conventional sense. The open-loop system © is Lyapunov 
stable in the forward time-direction for any initial condition specified at any time (either 
positive or negative), but not uniformly so. Incidentally, the same is true for stability in the 
backwards time-direction. With the feedback © in force the system becomes uniformly 
stable in the sense of Lyapunov in the forward time-direction and unstable in reverse. In 
the perspective of the present paper, any method to force agreement between BIBO stability 
on the double-axis and conventional notions (such as requiring causality of the closed loop 
operators) might be seen as tantamount to directly imposing the desired time arrow within 
the stability definition. 

In several papers (e.g., [24], [22], [23]) Makila and Partington have advocated the use 
of a two-operator model for systems on the doubly infinite time-axis in the form Ay = Bu, 
where A, B are causal, bounded operators, in contrast to a single-operator model y = Pu, 
where P is causal and possibly unbounded. Closed-loop stability is defined as the existence 
of a causal, bounded inverse of the feedback system operator mapping system inputs to 
exogenous disturbances. Since this definition incorporates a causality requirement on the 
closed-loop system there is evidently a close relationship between this idea and the approach 
of Jacob. 



III. Time-Asymmetry and Optimal Regulation 



This section focusses on the time-asymmetry of the definition of stability and its im- 
plications in the context of optimal regulation. Firstly, a time-conjugation operator will be 
defined as well as the concepts of f-stability and b-stability. Then the finite-horizon quadratic 
regulator problem will be considered for a system running forwards in time and backwards 
in time, and it will be shown that the optimal cost is generally different. The infinite-horizon 
(asymptotic) regulator will also be considered in the same way. It will be shown that the 
optimal cost can be expressed in terms of the two extremal solutions of the appropriate 
algebraic Riccati equation. The result shows that ease of optimal regulation depends on the 
time-direction. 

A. The time-conjugation operator, f-stability and b-stability 

Let P denote a dynamical system described by the state-space equations in CE©, 
initialized at time zero and running forwards in time. Let Z denote the operation on P 
which corresponds to solving £Dl2]) backwards from t — followed by a flip of the time 
axis (so the new system runs forward again). More specifically we set t\ = —t, so that 

d __ d 
dt dt\ ' 
and then replace t\ by t which results in 

—x = Ax + Bu, with x(0) = x 
y = Cx + Du 

for the system 3(P). The effect on the transfer function is as follows: if P has transfer 
function P(s), then 3(P) has transfer function P(—s). 

Define the system P to be f-stable if A is Hurwitz, and define P to be b-stable if 
3(P) is f-stable, or equivalently, if —A is Hurwitz. It is immediately obvious that a linear 
time-invariant system of the type CD© can never be both f-stable and b-stable. Similarly, a 
controller which makes £Dl2]) f-stable cannot make it b-stable as well. 

B. The finite-horizon linear quadratic regulator 

Let P be a linear time-invariant system which is controllable and observable and de- 
scribed by CEH2K as before, with D = and x(0) = x . Consider the problem of the 
regulation of P with criterion 

J= t (y(t)'Qy(t)+u{t)'Ru(t))dt + x{T)'Hx(T). 
Jo 

This has solution 

u (t) = -R^B'S^xit) (6) 

where 

- S(t) = S(t)A + A'S{t) - SfyBR^B'Sit) + C'QC (7) 



and S(T) = H, with optimal cost J^t = x' S(0)xo [1]. When P runs backwards in time 
from x(0) = xo with cost 

J= ! (y(t)'Qy(t)+u(t)'Ru(t))dt + x(-T)'Hx(-T) 

J-T 

we can check that the optimal control is still given by © where S(t) satisfies © with 
S(—T) = —H, and that the optimal cost is J^t = —x' S(0)x . It can be readily verified 
that S(0) (forward case) is in general different from —S(0) (backward case), and so the 
optimal cost is different in the two cases, e.g., if A = B = C = Q = R = T=1 and 
H = 10, then 5(0) = 2.5415 in the forward case and -S(0) = 0.5495 in the backward 
case. 



C. The infinite-horizon linear-quadratic regulator 

Again let P be a linear time-invariant system which is controllable and observable and 
described by CD© with D = and x(0) = x . It is well-known [1] that 

POO 

J= (y(t)'Qy(t)+u(t)'Ru(t))dt (8) 
Jo 

has a minimum given by J f oo = x' S + x where S + is the unique positive-definite solution 
to the algebraic Riccati equation 

A'S + SA- SBR^B'S + C'QC = 0. (9) 

It is also well-known that S + is the unique solution of © for which A — BR~ l B'S has 
all its eigenvalues in the open LHP. In the language of the present paper we can say that 
S + is the unique solution of © which makes the system (H© f-stable with the controller 

u = -R- l B'Sx. 

What happens if we require the minimisation of 

J= I (y(t)'Qy(t)+u(t)'Ru(t))dt (10) 

J — oo 

for CD© running backwards in time? This is the same as the conventional problem for the 
system 3(P). It is easy to see that the minimum is given by Jb i00 = — ^o^Xo where 
is the unique negative-definite solution to ©. It is also well-known that S- is the unique 
solution of © for which A — BR~ l B'S has all its eigenvalues in the open RHP [44]. In 
the language of the present paper we can say that £_ is the unique solution of © which 
makes the system (d© b-stable with the controller u = —R~ l B'Sx. 

In general Jf j0O = x' S + x and Jb,oo = —x' S-X are different. This shows that "difficulty 
of control" is time-asymmetric for the standard linear-quadratic regulator on the infinite 
horizon. The difference can be significant, e.g. if A = 1, B = e, C = 1, Q = 1 and R = 1 

then S + = 2/e 2 + 1/2 + 0(e 2 ) and 5_ = -1/2 + 0(e 2 ) for e small. 



IV. Time-Asymmetry and Modelling Uncertainty 

In this section we look at the topology for uncertainty in feedback control and how this 
is affected by the time arrow. We will see that dynamical systems which are close in the 
usual sense, that a common controller can stabilise them and give a similar closed-loop 
behaviour, may not be close if time is reversed. 

A. The gap metric and robustness of stability 

Zames and El-Sakkary [43] introduced a metric on dynamical systems for the purpose of 
assessing robustness. This was based on the gap metric used in functional analysis to study 
invertibility of operators [19], [32]. Specifically, systems are considered to be operators on 
£2[0, oo) with a graph which is a closed subspace of £2(0, 00). Consider two linear systems 
Pi (i — 1, 2) with transfer functions 

Pi(s) = rii(s) (mi(s)y 1 

where n^s) and m^s) are coprime polynomials or, more generally, right-coprime polyno- 
mial matrices. Let 

(n i (-s)) T ni(s) + (m i (-s)) T m i (s) = (^(-s)) T d^s) 

with det (di(s)) a Hurwitz polynomial and ( ) T representing matrix transpose — the exis- 
tence of such a polynomial (matrix) di(s) is a standard result in the theory of canonical 
factorization [41]. Then, 

/miOOWs))- 1 ^ 
Gpi,H2 '■= \7~L2 ■= Gi(s)H 2 

V^sXdiOO)- 1 J 

is (the Fourier transform of) the graph of P i5 for i — 1,2. Thus, the graph symbol Gi(s) 
generates the graph of P; as its range. Then the gap between Pi and P 2 is defined to be 
8ft 2 (Pi, P2) := ||n<? p H2 — rig P2 H2 1| where 11^ denotes orthogonal projection onto a closed 
subspace /C. 

Let the feedback configuration of Fig. [Qbe denoted by [P, C], where P and C are linear 
systems defined as operators on £2(0, 00) which may possibly be unbounded. Define 

H P ,c:= (p) (I - PC)" 1 (I -C) 

to be the operator mapping ( Uq ) T to ( uj y[ ) T . The following are basic robustness 
results for gap metric uncertainty. 

Proposition 4: [9] Assume that the closed-loop system [P, C] is f-stable. Then, 
[Pi, C] is f-stable for all Pi such that 5 H2 (P, Pi) < b if and only if b < 6 PiC where 



Proposition 5: [43] Assume that the closed-loop system [P,C] is f-stable. Then, 
the following are equivalent: 

(i) 5n 2 (P n , P) -> as n -> oo. 

(ii) Hp njC is f-stable for sufficiently large n and ||H Pn)C - H PiC ||oo ^ as n ^ oo. 

Proposition [5] was the primary justification for the claim in [43] that the gap metric 
defines the "correct" topology for robustness of feedback systems. In the present context, 
it can be seen that the choice of a signal space with support on the positive half-line is 
essential in achieving an appropriate topology. To emphasize the point, if £ 2 [0, oo) were 
replaced by £ 2 (— oo, 0] then the above proposition would hold with f-stability replaced by 
b- stability. 

Let us consider the case where systems are defined on £ 2 (— oo, oo). Then we define 

5£ 2 (Pl> P 2) := ||ng Pi £2 -ng P2 £ J| 

where 

Qp t ,C2 Gi(s)C 2 

and £ 2 := ^2(— joo, joo). With this definition, Qv,c 2 ls always closed, but may contain 
"non-causal" input-output pairs (as pointed out in [8] — see also Section III-BI) . It is easy to 
construct examples to demonstrate that convergence of <5c 2 (P n ,P) to zero does not allow 
any closed-loop stability prediction, e.g., [P, C] f-stable does not imply [P n , C] f-stable for 
sufficiently large n. 

In [36] Vinnicombe introduced a new metric 5 V (-, •) on dynamical systems which defines 
the same topology as 5n 2 (-, •)» an( ^ wn i cn satisfies the following inequality: 

Wv) <£,(•,•)< <Wv)- 
The v-gap between P 1 and P 2 is defined as follows: 

f ^(Px.Pa) if 

^(P 1; P 2 ) := I wno(det(G 2 (- S ) T G 1 ( S ))) =0, (11) 
I 1 otherwise, 

where wno(<?(s)) denotes the winding number about the origin of g(s), as s traces the 
standard Nyquist D-contour [36], [37]. A simple expression for 5c 2 (-, ■) can be obtained using 
left fractional representations — let Pi(s) = (m i (s))~ 1 h i (s) be a left-coprime polynomial 
fraction, d{ the Hurwitz polynomial matrices which satisfy 

fk{s) (hi(-s)) T + rhi(s) (rhi(-s)) T = d i (s)(d i ( y -s)) T , 

and define 

Gi(s) := {-(diis))- 1 ^), {diis))- 1 ™^)) 

for i = 1,2. The graph of Pj is the kernel of multiplication by Gi(s) (in the respective 
space of signals H 2 or ^ 2 ). The £ 2 -gap can now be expressed as 

5 £2 (Pi,P 2 ) := \\G 2 (s)G 1 (s)\\ 0O . 



It turns out that Propositions |4] and [5] both hold with 5u 2 replaced by 5 V (see [36]). Since 
3c 2 — $v when 

wno(det(G 2 (-s) T G'i(s))) = (12) 

holds, this condition effectively imposes a positive time-arrow on the double-axis graph 
which forces f-stability to be retained under small perturbations in 5 V (-, •). This is illustrated 
by the following result (which can be readily derived from [36, Theorem 4.2]; see also [10]). 

Proposition 6: Let [P, C] be f-stable and suppose S C2 (P n , P) — > as n -> oo. Then 
[P n , C] is f-stable for all sufficiently large n if and only if wno(det(G n (-s) T G(s))) = 
for all sufficiently large n. 



B. The effect of the time-arrow on gap distances 

We define a forward and a backward v-gap as follows, 

5„ j/ (P 1 ,P 2 ) := <f„(Pi,P 2 ) 

^ 6 (Px,P 2 ) := ^(a(Pi),a(p 2 )). 

It is straightforward to see that 

<y £2 (Pi,p 2 ) = 5£ 2 (a(Pi),a(p 2 )), 

so any difference between 5 v j(P\, P 2 ) and S v> b(Pi, P 2 ) lies in the winding number condition 
in (fTTI) . Let us examine this more closely. Note that 

det(G 2 (- S ) T G 1 ( S )) - ^ (S) 



det(d 2 (-s))det(cii(s)) 
where 

h(s) := det(m 2 (— s) T mx(s) + n 2 (— s) T ni(s)). (13) 

If 5c 2 (Pi, P 2 ) < 1 then it can be shown that wno(det(G 2 (— s) T Gi(s))) is well-defined [36], 
in which case h(s) admits a canonical factorization 

h(s) = h+{s)h-{s) (14) 

where h + (s) and h-(—s) are Hurwitz polynomials. Thus, wno(det(G 2 (— s) T Gi(s))) = if 
and only if 

deg(/i + (s)) = deg(det(di(s))), 

or equivalently 

deg(A_(s)) = deg(det(d 2 (s))) > 

It can be shown that the degree of det(c£j(s)) is equal to the McMillan degree of Pj (e.g. 
using the uniqueness of normalised coprime factors over up to a constant unitary 
transformation and the corresponding state-space realisations [26], [35], [44]). Determining 
the graph symbol for 3(P«) requires a canonical factorization 

(ni(s)) T rii(-s) + (nii(s)) T nii(-s) = (di(-s)) T di(s) 



with det(c?j(s)) a Hurwitz polynomial. Again it can be shown that the degree of det(c?j(s)) 
is equal to the McMillan degree of Pj. The corresponding winding number condition in 
5„ j6 (Pi, P 2 ) can now be expressed as 

wno(det((rf 2 (-s)" 1 )/i(-s)(J 1 (s)- 1 ))) = 

which is equivalent to deg(/t_(s)) being equal to the McMillan degree of Pi. We therefore 
obtain the following result. 

Proposition 7: Let Pj(s) {i = 1,2) be the rational transfer functions of linear time- 
invariant dynamical systems as above, with McMillan degrees and with h,h + ,h^ 
as in H13H14[) . Assume that 8 C2 (P U ~P 2 ) < 1. 

1) The following are equivalent: 

a) <M p i, p 2) < 1. 

b) deg(/i + (s)) = /ii, 

c) deg(/i_(s)) = // 2 . 

2) The following are equivalent: 

a) <y Wi6 (Pi,P 2 ) < !. 

b) deg(/i_(s)) = nx, 

c) deg(/i+(s)) = // 2 . 

3) The following are equivalent: 

a) ^ / (P 1 ,P 2 ) = ^ j6 (P 1) P 2 ) < 1, 

b) /ii = yu 2 = deg(/i + (s)) = deg(/i_(s)). 

In the above proposition, ij expresses the zero winding number condition in (fTTT) in 
an equivalent form, while 2) does the same for (^(Pi, P 2 ). It is interesting that when 
the two conditions are combined as in 3) the result is a very stringent requirement which 
includes the necessity that P\(s) and P 2 (s) have the same McMillan degree. This serves to 
highlight the fact that "unmodelled dynamics" which may account for a small error in 5 v j 
(and which may be neglected in the design of a robust controller) will inevitably account 
for a substantial error in 5 V: b- 

Example 8: Consider two systems with different McMillan degrees, e.g. P\{s) = 1, 
P 2 (s) = 1/s. It can be computed that <5„j(Pi,P 2 ) = l/y/2. Proposition [7] then tells us 
immediately that ^^(Pi, P 3 ) = 1 where Ps(s) = —1/s. Similarly, if Pi(s) = 1/s 2 , then 
Proposition [7] tells us that both <5„j(Pi, P 4 ) = <5^ i fe(Pi,P 4 ) = 1 since Pi(s) = P 4 (— s). 



V. Time-asymmetry and robust control 

This section addresses the implications of the time-asymmetry in the theory of robust 
control. In particular, we will also see that a system which is "easy" to control in one 
direction of time may be far from easy to control in the opposite direction. 



A. Optimal robustness and difficulty of control 

In [11] it was shown that bp c could be maximised over all stabilising C and that this 
amounts to solving a Nehari problem [44]. This optimum value, which we denote by 

&opt,f(P), 

can be interpreted as a measure of ease/difficulty of control, where a value near to 1 means 
the plant is "easy to control" and a value near means the plant is "hard to control". 

With the understanding that & optjf (P) has the meaning of "ease of control" with respect 
to the forward time-arrow for stabilty, it is interesting to define 



AiptM-T ) ■— «opt,f 



(3(P)), 



which represents "ease of control" with respect to the backwards time-arrow. Our main 
purpose in defining 6 pt,b(P) is to highlight the influence of the time-arrow in feedback 
regulation. 

Let P be a controllable and observable system which is described by the state-space 
equations in CQIU), as before. Then, following [11], [9], 

6opt,f(P) = a/1 - A max (r+x + ) 
where Y + is the positive definite solution of the Riccati equation 

A Y + YA* - YCR~ X C*Y 

+B(I - D*R- 1 D*)B* = (15) 

where A = A — BD*R~ 1 C and R = I + DD*, and X + is the corresponding solution to 
the (F-dependent) Lyapunov equation 

(A - YC^R^Cyx + X{A - YC'BT^C) 

+C*R- 1 C = (16) 

for Y = Y + . Similarly, it can be seen that 

&o P t,b(P) = v 7 ! " A max (YlX_) (17) 



where Y- is the negative definite solution of the Riccati equation (U51) while X_ is the 
corresponding solution to (fT6l) for Y = YL. 

In the following two examples we will see situations where 6 O pt,f(0 and 6 O pt,b(0 arc 
very different. 

Example 9: Near pole-zero cancellations. 

Consider P(s) = 1 + Letting -A = C = D = 1 and B = e in equations (fl5 |) - ([T7T) 
gives 



J op t,f 



(P) 



'I + e + e 2 /2- 1 -e/2 
2 x /l + e + e 2 /2 



It follows that for small values of e, 



6 op t,f(P) = 1 



1 



e 2 + 0(e 3 ) 



32 



and hence, 6 opt f (P) — ► 1 as e — ► 0. On the other hand, 




which leads to 



WP) = 7N+0(e 2 ) 



for small values of e, and hence 6 op t,b(P) —> as e — > 0. This is accounted for by the fact 
that P(s) has a near pole-zero cancellation in the LHP, which is innocuous for f-stabilisation, 
but highly challenging for b-stabilisation. The latter is equivalent to f-stabilisation of P(—s), 
which has a troublesome near pole-zero cancellation in the RHP. 

Example 10: Riding Bicycles. 

A feedback stability problem in everyday experience is bicycle riding. An elementary 
model to study rider-bicycle stability is given in [2] which gives the following transfer 
function from steering angle input to tilt angle: 



where a,/3, 7 are positive constants and V is the forward speed. This model has one RHP 
pole, but the zero is in the LHP. As such, this plant is not too difficult to control. 

Let us consider what happens if we try to ride the bicycle backwards in time. This 
corresponds to trying to stabilise the plant P(—s) forwards in time. The model still has one 
RHP pole, but the zero is also in the RHP, which makes stabilisation much more difficult. 
Indeed if V(3 = ^7 the plant is technically not stabilisable. It is interesting to note that an 
experimental bicycle with the steered wheel at the rear instead of the front has a transfer 
function from steering angle input to tilt angle given by [2] (see also [21]) 



This is exactly the transfer function for the conventional bicycle ridden backwards in time. 

Figure [2] shows the value of 6 pt,f an d b op t,b versus V with parameter values a = 1/3, 
(5 — 2 and 7 = 9 (which are deemed reasonably realistic). Recall that 6 op t,b is the same as 
&opt,f f° r the rear- wheel steered bicycle model (TT9b at the same V . It can be observed that 
fropt,b is less than 6 op t,f for any V. Also, 5 op t,b is very small for low V, indicating difficulty 
of control, and zero at V = 1.5 m/s. For larger V, & op t,b increases, indicating that control 
becomes easier. These results are equivalent to the rear-wheeled steered bicycle being more 
difficult to ride than the front-wheel steered one, but still being reasonably controllable at 
higher speeds [2]. 



aV 




(18) 



aV 



s + f3V 
s 2 — 7 



(19) 



V in m/s 

Fig. 2. 6 op t,f and 6 op t,b versus V for the bicycle model of d!8t with a = 1/3, /3 = 2, and 7 = 9. 

5. Tzme irreversible feedback phenomena 

The concept of ease or difficulty of control gives a thought-provoking perspective on 
reversibility. Systems which in a limiting situation are very difficult to control (in the sense 
that & pt,f (P) tends to zero) are unlikely to be observed in nature or technology. Nevertheless, 
such a system may be easy to control in the time-reversed direction (see Examples [9] and 
[TOl) . This is independent of the fact that the underlying differential equation can be integrated 
equally well in either time-direction. This is reminiscent of phenomena (such as a bottle 
falling from the table and shattering into many pieces) that appear to be associated with 
an intrinsic direction of time even though classical physics would also allow the reversed 
motion as a solution (see Section [VI] for a further discussion). 

We expand this point in the context of Example [TOl The loss of stabilizability of the rear- 
wheeled steered bicycle at V = 1/7//? has the following interesting consequence. Imagine 
a video of a rear-wheeled steered bicycle being ridden stably at this critical speed. Let us 
assume that it is possible to verify from the video the actual speed (e.g., by knowing the 
frame-rate and observing markings on the ground). An observer with a good grounding in 
control theory would be led to the inescapable conclusion that the video had been made 
when the said bicycle was actually being ridden backwards in space (i.e., with a negative 
V) and then played backwards in time as well, giving the impression of a forward motion. 

VI. The arrow of time in physics 

The subject of the "arrow of time" is a well-known conundrum in physics. The second 
law of thermodynamics states that the entropy of a system increases with time. It is the 
time- asymmetry in this law which gives rise to the notion of the "thermodynamic arrow of 
time". The classical derivation of the second law in statistical mechanics due to Boltzmann is 
connected with a famous puzzle known as Loschmidt's paradox [40]. This essentially points 
out that the laws of mechanics used in the derivation of the second law are time- symmetric 
whereas the conclusion is not. Evidently the time-asymmetry creeps in through the statistical 



assumptions. An illuminating discussion of this issue is given in [29]. Other arrows of time 
have also been defined, for example (i) the "psychological arrow" — the direction in which 
time passes as perceived by a sentient being [14], [33], (ii) the "cosmological arrow" — 
the direction of time in which the universe is expanding. Hawking [14] argues that the 
thermodynamic and psychological arrows are always aligned with each other but these need 
not always be aligned with the cosmological arrow (though they are at present). 

In this paper we have described the time- asymmetry in the definition of control systems 
stability as a time-arrow. In the theory of dynamical systems there is also the notion of 
passivity, which again defines a time-arrow. For electrical circuits the time-arrow of passivity 
can be seen in the behaviour of the resistor, in contrast to the inductor and capacitor which 
are time- symmetric in their operation. If the electrical resistor were to operate backwards 
in time one would observe a resistor gathering low-grade heat from the environment and 
charging up a battery. This behaviour would be recognised as a violation of the second 
law of thermodynamics (see [20, pages 260, 390-2]). In a similar way, an ideal linear 
damper operating backwards in time extracts low-grade heat from the environment to create 
mechanical work, in violation of the second law. It seems that the arrow of time in passive 
systems or circuits coincides with, or is the same as, the thermodynamic arrow. 

How does the arrow of time for control system stability relate to other time arrows? It 
is highly unlikely that a control engineer who is designing a control system for a plant will 
give even a moment's thought to the preferred time arrow for control. Without expressing 
the thought, the designer will seek decaying free motion solutions in the direction in which 
time is perceived to be passing. In this way the arrow of time for control could be said 
to coincide with the psychological arrow. On the other hand, in biological systems, active 
control is ubiquitous. It is less obvious that, for example, homeostasis in a cell is aligned 
with the psychological arrow. Here we will be content to raise the question of whether the 
stability arrow for control systems in general can be directly related to the thermodynamic 
arrow, e.g. by considering information flow or the effect of internal energy sources. 

Finally, from a purely mathematical point of view, we observe that the arrow of time for 
control systems stability appears identical with the arrow of time for passivity. This supports 
the conclusion that the arrow of time for control systems stability always coincides with the 
thermodynamic (and psychological) arrow. 

VII. Feedback loops and time delays 

Let D T : x(t) h- > x(t — r) denote a time delay operator. It seems superfluous to 
say that D T is physically realisable for r > 0. Indeed the delay is a common feature of 
communication and control systems. For r < 0, D T is the ideal predictor which is not 
believed to be physically realisable as a "real-time" device. At first sight this "fact" appears 
to be self-evident, but its subtlety is revealed on closer examination — indeed, a rigorous 
justification appears not to be available at present. An insightful discussion of the issue of 
"causation" and its connection with the arrow of time is given in Price [29, Chapter 6]. Price's 
suggestion that the asymmetry of causation "is a projection of our own temporal asymmetry 
as agents in the world" [29, page 264] is similar to the view expressed by Bertrand Russell: 
"The law of causality, I believe, like much that passes muster among philosophers, is a relic 



of a bygone age, surviving, like the monarchy, only because it is erroneously supposed to do 
no harm". This prevalence in physics and philosophy of an anthropocentric explanation of 
causation sits in opposition to the belief of the unrealizability of a "prediction machine" out 
of physical components and processes, and suggests that a deeper analysis of the question 
is needed. 

In this paper we will not attempt to further debate the origin and explanation of causation. 
In the next section we will simply highlight the striking difference in behaviour of feedback 
loops with small delays versus predictors and confirm the difference using the forward-time 
gap metric. 

A. Feedback stability, delays and predictors 

Consider a feedback system which consists of an integrator in series with a time delay 
and negative unity feedback. The governing equation is 



where d(t) denotes an external disturbance. We set d(t) = and consider the totality of 
all free motion solutions of the system equations. If all solutions decay as t — > +00 we 
say the system is f-stable. This definition agrees with the one given in Section IIII-AI for 
finite-dimensional systems. 

For r > we can verify that (l20l) is f-stable. Taking Laplace transforms in (1201) gives 



We can verify that all zeros of s + e ST = are in the LHP so the system is f-stable. 

Now consider the case where r < 0. Note that this corresponds to an integrator with a 
predictor in negative feedback, which we would not expect to be realizable in the forward 
time direction. In fact, s + e~ ST has infinitely many zeros in the RHP for any r < and 
hence the system fails to be f-stable. It is evident that the system displays a discontinuity 
in the asymptotic (as t — > 00) behaviour of the free motion at the point r = 0. 

Let us now consider the closeness of the systems involved using the v-gap metric. Let 
P denote the integrator and P r denote the integrator in series with D r . Regarding these as 
operators on £2(0, 00) we have the graph: 



x(t) + x(t - t) = d(t) 



(20) 



1 



d(s). 



s + e 




for r > 0. Then 



s 



(l-e-)ll 



00 



which tends to zero as r 



0. Also 



so providing |t| < n there are no crossings of the negative real axis of this function when 
s = juj. Hence, wno(G2(— s) T G\(s)) = 0, for r sufficiently small. This implies 



for t > and sufficiently small and 8 v j(P, P T ) — > as r — > 0. 

Now consider the case of P T with r < 0. Again regarding P r as an operator on £2(0, 00) 



which behaves like e ST for large s, so the winding number of this function is not zero and 
(J t)i/ (P,P T ) = lforr<0. 

The above analysis with the gap agrees with the earlier conclusion on f-stability. For 
t > 0, f-stability was retained for sufficiently small r, but lost for any r < 0. Now we have 
seen that, as long as t > 0, there is a small error in 5 v j, but for any r < 0, 5 v j(P, P T ) = 1. 

Finally, it is interesting to mention that the tolerance of feedback loops to small time- 
delays is guaranteed by a well-known sufficient condition that the high-frequency loop-gain 
of the feedback loop is smaller than one ([3], [6], [39], [42]) — a condition routinely met in 
practice. It is easy to check that robustness to an arbitrarily small "parasitic predictor" in the 
loop would be guaranteed theoretically by the loop-gain being greater than one at arbitrarily 
high frequencies — a condition that appears impossible to achieve in a real feedback system. 



1) Stability is a time-asymmetric concept. The requirement of an asymptotic property as 
t tends to PLUS infinity defines a time arrow. 

2) A stability definition which requires bounded outputs in response to bounded inputs 
does not obviously imply a time arrow. For signal spaces with support on a positive 
(resp. negative) half-line, the definition turns out to imply a positive (resp. negative) 
time arrow. 

3) A bounded-input bounded-output definition of stability for signals with support on 
the doubly-infinite time-axis does not define a preferred time arrow. Stable systems 
defined by bounded multiplication operators may be stable in the sense of Lyapunov 
in the positive time direction, in the negative time direction or in neither direction. 

4) The fact that the closure of the graph of an unstable causal system may coincide with 
the graph of a stable anti-causal system on the doubly-infinite time-axis need not be 
a fundamental obstacle in developing a usable control theory on the doubly-infinite 
time-axis. 

5) Any method which modifies the BIBO definition of stability on the doubly-infinite 
time-axis to agree with conventional stability notions could be interpreted as the 
imposition of a positive time-arrow. 



< 5 B>/ (P,P T ) = ( 5 jCa (P,P T ) 



we have: 




and5 £2 (P,P T ) 




VIII. Synopsis 



6) A time-conjugation operator on systems was defined as well as the concepts of f- 
stability and b-stability. 

7) Both the finite-horizon and infinite-horizon quadratic regulators give a different optimal 
cost for a system running forwards in time and backwards in time. In the infinite 
horizon case the optimal cost can be expressed in terms of the two extremal solutions 
of the appropriate algebraic Riccati equation. 

8) The role of the positive time arrow in the gap metric measure of uncertainty for 
dynamical systems was highlighted. The usual 7i 2 -gap metric inherits the positive 
time arrow by virtue of systems being defined as operators on the positive half-line. 
The £ 2 -gap metric, which is well known to define an inappropriate topology for robust 
control, does not have a preferred time-direction due to the underlying operators being 
defined on the double-axis. The v-gap metric may be interpreted as the C 2 -gap with 
an imposed time-arrow. 

9) A time-conjugated v-gap metric was defined to measure closeness for robust b-stabilisation. 
It was seen that closeness of systems in the forward and backwards directions is a 
strong condition which includes the requirement of equal McMillan degrees. 

10) It was seen that ease or difficulty of control as measured by optimal robustness in the 
gap metric is a property that depends on the time-arrow. 

11) The situation of a plant which is easy to control in one time-direction but impossible 
to control in the other shows that irreversibility can be intimately related to control. 

12) An engineering perspective of control suggests a close link between the control system 
stability arrow and the psychological arrow. Unified mathematical frameworks for 
passive circuits and feedback control suggest a close link between the control system 
stability arrow and the thermodynamic arrow. The question was raised whether the 
stability arrow for control systems can be directly related to the thermodynamic arrow. 

13) The issue of the non-realizability of the pure predictor as a "real-time" device and 
the connection with the arrow of time was highlighted as well as the difficulty of es- 
tablishing non-realizability rigorously. The strongly contrasting behaviour of feedback 
loops in the presence of arbitrarily small time-delays or predictors was pointed out. 
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